Non-parametric k-sample tests: Density functions vs distribution functions

نویسندگان

  • Pablo Martínez-Camblor
  • Jacobo de Uña-Álvarez
چکیده

In this paper we introduce some tests for the comparison of k samples based on kernel density estimators (KDE), and we develope the Double Minimum method as a new and useful procedure for the crucial problem of bandwidth selection. We study, via Monte Carlo simulations, the statistical power of the proposed tests, as well as the impact of the smoothing degree and the performance of the Double Minimum algorithm. Finally, we compare the results of the tests based on the KDE to those of the traditional k-sample tests based on empirical distribution functions (EDF), and to other tests based on the likelihood ratio introduced in the recent literature. Two main conclusions are obtained. First, the proposed bandwidth selection method attain quasi-optimal results. Second, the simulations suggest that KDE-based tests are the most powerful when the underlying populations are different in shape.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robustness to non-normality of common tests for the many-sample location problem

Abstract. This paper studies the effect of deviating from the normal distribution assumption when considering the power of two many-sample location test procedures: ANOVA (parametric) and Kruskal-Wallis (non-parametric). Power functions for these tests under various conditions are produced using simulation, where the simulated data are produced using MacGillivray and Cannon’s [10] recently sugg...

متن کامل

Nonparametric Transition-Based Tests for Jump Diffusions

We develop a specification test for the transition density of a discretely sampled continuous-time jump-diffusion process, based on a comparison of a nonparametric estimate of the transition density or distribution function with their corresponding parametric counterparts assumed by the null hypothesis. As a special case, our method applies to pure diffusions. We provide a direct comparison of ...

متن کامل

Optimal power flow based on gray wolf optimization algorithm using probability density functions extraction considering wind power uncertainty

In recent years, utilization of the renewable based power plants has become widespread in the power systems. One of the most widely used renewable based power plants is wind power plants. Due to the utilization of wind energy to generate electricity, wind turbines have not emitted any environmental pollution. Thus, in addition to economic benefits, utilization of these power plants is of great ...

متن کامل

Height and Crown Area Distribution of Cionura erecta Shrub lands in chaharmahal and Bakhtiari Province, Using Probability Distribution Functions

Importance of probability distribution functions in natural resource studies is increasing due to their effective roles in better understanding of vegetation structure and providing conceptual models of quantitative indices of plant species. The present study was performed to model the distribution of height and canopy area of Cionura erecta L. shrub, using probability distribution functions in...

متن کامل

Topics in kernel hypothesis testing

This thesis investigates some unaddressed problems in kernel nonparametrichypothesis testing. The contributions are grouped around three main themes:Wild Bootstrap for Degenerate Kernel Tests. A wild bootstrap method for non-parametric hypothesis tests based on kernel distribution embeddings is pro-posed. This bootstrap method is used to construct provably consistent teststh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 53  شماره 

صفحات  -

تاریخ انتشار 2009